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(54) Automated lip sync error correction 



(57) An automated lip sync error corrector embeds 
a unique video source identifier ID into the video signal 
from each of a plurality of video sources. The unique 
video source ID may be in the form of vertical interval 
time code user bits or in the form of a watermark in an 
active video portion of the video signal. When one of the 



video signals is selected, the embedded unique video 
source ID is extracted. The extracted source ID is used 
to access a corresponding delay value for an adjustable 
audio delay device to re-time a common audio signal to 
the selected video signal. A look-up table may be used 
to correlate the unique video source ID with the corre- 
sponding delay value. 
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Description 

BACKGROUND OF THE INVENTION 

[0001] The present invention relates to television 
technology, and more particularly to automated lip sync 
error correction using an embedded video source iden- 
tifier to control an adjustable audio delay. 
[0002] Audio-to-video (A/V) synchronization is not a 
new problem. Television sources (video and audio) most 
often start with their video and audio synchronized such 
that the audio signal associated with the video signal is 
created at the same time. One exception to this is a 
sporting event shot with a long focal-length lens where 
the audio is received at some distance with a directional 
microphone. Wireless cameras are sometimes used 
and possibly field or sub-switched with wired cameras. 
The wireless camera uses a compression coder/decod- 
er that adds video delay relative to the audio and wired 
camera video. Since a separate microphone, not part of 
the wireless camera system, often picks up the audio 
near the source, there is an additional A/V delay when 
the wireless camera is switched in place of the wired 
camera. Additionally the video often undergoes digital 
processing over video frames, making it arrive even lat- 
er relative to the audio. Without adding a separate audio 
delay, the result is the annoying television signal re- 
ferred to as lip-sync error. This is particularly annoying 
when the video or action precedes the audio, as in the 
case described. 

[0003] What is desired is a method for automated cor- 
rection of audio-to-video (A/V) lip sync error where a 
separate microphone is used as the audio source for a 
plurality of video sources, each video source introducing 
a different video processing delay, in order to re-time the 
video and audio signals to a desired relationship. 

BRIEF SUMMARY OF THE INVENTION 

[0004] Accordingly the present invention provides an 
automated lip sync error correction using an embedded 
video source identifier (ID) to control an adjustable audio 
delay. A separate directional microphone acts as an au- 
dio source and a plurality of camera, both wired and 
wireless, act as video sources. The video signal from 
each video source has a source ID embedded in it, ei- 
ther in the form of vertical interval time code user bits or 
in the form of a watermark in the active video portion of 
the video signal, or both. The video sources are coupled 
to a camera switch that selects a particular one of the 
video signals. The source ID is extracted from the se- 
lected one of the video signals and used to access a 
look-up table that associates audio time delay to a pre- 
determined video source delay. The associated audio 
time delay is applied to an adjustable audio delay device 
to re-time the audio signal from the audio source with 
the selected one of the video signals. 
[0005] The objects, advantages and other novel fea- 



tures of the present invention are apparent from the fol- 
lowing detailed description when read in conjunction 
with the appended claims and attached drawing. 

5 BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF 
THE DRAWING 

[0006] The Figure is a block diagram view of a televi- 
sion system having multiple cameras with different vid- 
eo delays and a separate common microphone that pro- 
vides automated lip sync error correction according to 
the present invention. 

DETAILED DESCRIPTION OF THE INVENTION 

[0007] Referring now to the Figure an event being tel- 
evised, such as a golf match in this illustration, uses a 
single directional microphone 1 2 as an audio source and 
a plurality of cameras 14, 16 as video sources. The cam- 
eras 14, 16 include both cameras directly wired to a 
camera switch 22 and cameras that are wirelessly cou- 
pled to the camera switch. Each camera 14, 16 intro- 
duces a different amount of video processing delay to 
the video signal relative to the audio signal from the mi- 
crophone 12. In particular the wireless cameras 16 in- 
troduce a greater video delay than the corresponding 
wired cameras 14. The wireless cameras 16 generally 
compression code the video signal, which is then trans- 
mitted via a camera antenna 1 7 wirelessly through a re- 
ceiver antenna 18 to a receiver 20 where the video sig- 
nal is compression decoded. 

[0008] The broadband video signal from each camera 
14, 16 is input to a source identifier (ID) encoder 24 prior 
to being input to the camera switch 22. Each source ID 
encoder 24 may be either a vertical interval time code 
(VITC) encoder or a watermark encoder, or both. The 
VITC encoder encodes the unique source ID for each 
camera 14, 16 via VITC user bits, as is well known in 
the art. The watermark encoder, which is especially use- 
ful where only the active video portion of the video signal 
is transmitted, i.e., in digital video signal format, en- 
codes the unique source ID for each camera 14, 16 via 
a watermark embedded in the active video portion of the 
video signal using standard watermarking techniques, 
such as those described in U.S. Patent No. 6,211,919 
issued April 3, 2001 entitled "Transparent Embedment 
of Data in a Video Signal." The camera switch 22 selects 
one of the source identified video signals for input to a 
decoder/delay device 26 together with the common au- 
dio signal. 

[0009] The decoder/delay device 26 includes a 
source ID decoder 28 into which the selected source 
identified video signal is input and an adjustable audio 
delay device 30 into which the audio signal is input. The 
source ID decoder 28 extracts the source ID from the 
selected source identified video signal. Based on the 
source ID the decoder 28 accesses a look-up table 30 
to select a pre-determined delay value corresponding to 
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the camera identified by such source ID. The delay val- 
ue is output by the decoder 28 to the delay device 30 to 
select the delay to be applied to the audio signal. The 
video and audio signal outputs from the decoder 28 and 
delay device 30 are input to a program uplink or satellite 
television link (STL) 34 in synchronization with each oth- 
er, i.e., the audio is re-timed with the video. The delay 
differences between the cameras is generally fixed and 
known. Therefore the software "look-up" table 32 has 
the known delays for each camera source ID. 
[0010] Thus the present invention provides automat- 
ed lip sync error correction for a television system hav- 
ing a separate audio source for multiple video sources 
by embedding a source ID into each video signal, se- 
lecting one of the video sources for output, extracting 
the source ID from the video signal for the selected video 
source, and switching in the appropriate delay for an ad- 
justable audio delay device depending upon the extract- 
ed source I D, thereby re-timing the audio signal from the 
audio source with the selected video signal. 



Claims 

1. In a television system having a separate audio 
source with a plurality of video sources and a switch 
for selecting one of the video sources for output with 
the audio source, an automated lip sync error cor- 
rector comprising: 

means for embedding a unique source identifi- 
er into a video signal from each of the video 
sources before input to the switch; 
means for extracting the unique source identi- 
fier from the video signal selected by the switch 
for output; and 

means for adjusting a delay for an audio signal 
from the audio source as a function of the ex- 
tracted unique source identifier to re-time the 
audio signal with the selected video signal. 

2. The corrector as recited in claim 1 wherein the ad- 
justing means comprises: 

means for accessing a delay value associated 
with the extracted unique source identifier; and 
means for applying the delay value as the delay 
for the audio signal to re-time the audio signal 
with the selected video signal. 

3. The corrector as recited in claims 1 or 2 wherein the 
unique source identifier comprises vertical interval 
time code user bits. 

4. The corrector as recited in claims 1 or 2 wherein the 
unique source identifier comprises a watermark. 

5. In a television system having a separate audio 



4 

source with a plurality of video sources and a switch 
for selecting one of the video sources for output with 
the audio source, an automated lip sync error cor- 
rector comprising: 

5 

an encoder coupled to each video source for 
embedding a unique source identifier into a vid- 
eo signal from the video source before input to 
the switch; 

10 a decoder coupled to the switch for extracting 

the unique source identifier from the video sig- 
nal selected by the switch for output; and 
an adjustable audio delay device for delaying 
an audio signal from the audio source as a func- 

15 tion of the extracted unique source identifier to 

re-time the audio signal with the selected video 
signal. 

6. The corrector as recited in claim 5 wherein the de- 
20 coder comprises a lookup table having a delay val- 
ue for each unique source identifier, the delay value 
for the selected video signal being accessed and 
applied to the adjustable audio delay device to re- 
time the audio signal with the selected video signal. 

25 

7. The corrector as recited in claims 5 or 6 wherein 
each encoder comprises a vertical interval time 
code encoder for embedding unique vertical inter- 
val time code user bits into the video signal from the 

30 video source. 

8. The corrector as recited in claims 5 or 6 wherein 
each encoder comprises a watermark encoder for 
embedding a watermark into the video signal from 

35 the video source. 

9. In a television system having a separate audio 
source with a plurality of video sources and a switch 
for selecting one of the video sources for output with 

40 the audio source, an automated lip sync error cor- 
rection method comprising the steps of: 

embedding a unique source identifier into a vid- 
eo signal for each video source before input to 
45 the switch; 

extracting the unique source identifier from the 
video signal selected by the switch for output; 
and 

delaying an audio signal from the audio source 
50 as a function of the extracted unique source 

identifier to re-time the audio signal with the se- 
lected video signal. 

10. The method as recited in claim 9 wherein the delay- 
55 jng step comprises the steps of: 

accessing a look-up table for a delay value as- 
sociated with the extracted unique source iden- 
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tifier; and 

applying the delay value to an adjustable audio 
delay device to re-time the audio signal with the 
selected video signal. 

5 

1 1 . The method as recited in claims 9 or 1 0 wherein the 
unique source identifier comprises vertical interval 
time code user bits. 

12. The method as recited in claim 11 wherein the 10 
unique source identifier comprises a watermark. 
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